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£X| Abstract. We present a relation-algebraic model of Condorcet voting 

t , and, based on it, relation-algebraic solutions of the constructive control 

C^ problem via the removal of voters. We consider two winning conditions, 

^^; viz. to be a Condorcet winner and to be in the (Gilles resp. upward) 

uncovered set. For the first condition the control problem is known to 
OO be NP-hard; for the second condition the NP-hardness of the control 

problem is shown in the paper. All relation-algebraic specifications we 
will develop in the paper immediately can be translated into the pro- 
gramming language of the BDD-based computer system RelView. Our 
approach is very flexible and especially appropriate for prototyping and 
experimentation, and as such very instructive for educational purposes. 
C/3 It can easily be applied to other voting rules and control problems. 

o 



—i 1 Introduction 

> 

Elections have been studied by scientists from different disciplines for more than 
a thousand years. In addition to the obvious moral and political issues, elections 
also give rise to several computational questions, which are studied in the field of 
Computational Social Choice. The most prominent of these questions is the re- 
quirement of an algorithm that efficiently computes the winner(s) of an election. 
^^. Surprisingly, such algorithms do not exist for all natural election systems, see 

,— I [12] for an example. However, elections also give rise to computational problems 

L| which ideally should be hard to solve: 

•i-H 

S^ — The manipulation problem (see [T]) asks to determine a way for a group of 

5— I voters to vote that serves their interest best, even though the vote might not 

represent their true preferences. Unfortunately, classical results show that 
every reasonable voting system gives voters incentives to vote strategically 
in this way (Gibbard-Satterthwaite theorem, cf. [11I17J ). 
— The control problem (see, e.g., [2]) asks for determining a way for the coor- 
dinator of an election to set up the election in a way that serves his or her 
personal interest. In order to achieve this, the coordinator might remove or 
add alternatives or voters from the election or partition the election. 

Following the above-mentioned paper pQ , numerous papers have studied the com- 
plexity of manipulation and control problems for elections (see, e.g., |7|9)13j ). 
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For many election systems, it can be shown that the studied control or ma- 
nipulation problem is NP-hard, and thus the election system is deemed to be 
'secure' against this attempt to influence the outcome of the election. However, 
it has long been observed that efficient algorithms that work for many cases 
can still exist for NP-hard problems, the very successful history of SAT solvers 
being an impressive example. In the context of Computational Social Choice, [6] 
demonstrates a fast and very simple algorithm that works correctly on 'most' 
inputs (according to a suitably chosen probability distribution) and is allowed 
to compute an incorrect result on the remaining inputs. 

In this paper, we study an alternative approach to show that NP-hard elec- 
tion problems may be solvable in practice. We apply the Computer Algebra 
system RelView (see [3|20] ). which uses mathematical tools from relation al- 
gebra in the sense of 18 19J, to implement algorithms for the control problem of 
an election. Our implementations are provably correct for all instances; hence, 
as the problems we study are NP-hard, our algorithms do not run in polynomial 
time in general. Instead, we rely on Rel View's optimization to exploit the sim- 
ple structure of most practical instances of the problems we study, which allows 
for an algorithmic treatment. 

Concretely, we study the following problem: Given an election consisting 
of a set of alternatives (also sometimes called candidates), voters along with 
information on how they will vote, and a prefered alternative a*, determine a 
minimum set Y of voters such that removing all voters in Y makes a* win the 
election. The election system we study is the Condorcet voting system with the 
uncovered set winning condition (in case there is no Condorcet winner). To the 
best of our knowledge, this is the first paper where a relation-algebraic approach 
is used to solve problems related to elections that directly take the individual 
votes into account. An advantage of our approach is that it is very general 
and allows to treat related problems for different election systems with only 
small modifications. In particular, we could also treat elections in a generalized 
setting, where voters' preferences are not linear orders (such a setting is studied 
in Oil]). A further advantage is that the correctness proofs for our algorithms 
are formalized in such a way that, in principle, their automatic verification is 
possible. Our results and the performance of our algorithms demonstrate that 
Computer Algebra tools can be used successfully to solve NP-hard problems, 
where the data structures used in the Computer Algebra package automatically 
allow to exploit the 'easyness' that may be present in practical instances. In our 
case, RelView uses BDDs to efficiently represent relations that are exponential 
in the input size. Thus, relation-algebraic algorithms can be obtained without 
specific knowledge about the problem domain. 



2 Relation-algebraic Preliminaries 

Given sets X and Y, we write R : X <H- Y if R is a (binary) relation with source 
X and target Y, i.e., a subset oi XxY. If the sets of R's type X O Y are finite, 
then we may consider R as a Boolean matrix. Since such an interpretation is 
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well suited for many purposes and also used by RelView as the main possibility 
to visualize relations, in this paper we frequently use matrix terminology and 
notation. Especially, we speak about the entries, rows and columns of a rela- 
tion/matrix and write R x , y instead of (x, y) G R or xRy. We assume the reader 
to be familiar with the basic operations on relations, viz. R T (transposition) , R 
(complement), R U S (union), RnS (intersection) and R;S (composition), the 
predicates iiC5 (inclusion) and R — S (equality), and the special relations 
(empty relation), L (universal relation) and I (identity relation). In case of 0, L 
and I we overload the symbols, i.e., avoid the binding of types to them. 



For R : A <-» Y and S : X^Z,by syq(R, S) = R T S D R T S their symmetric 
quotient syq(R, S) : Y -R- Z is defined. In the present paper we will only use its 
point-wise description, saying that for all y G Y and z G Z it holds syq(R, S) VyZ 
iff for all x G X the relationships R x . y and S XtZ are equivalent. 

In relation algebra vectors are a well-known means to model subsets of a 
given set X. Vectors are relations r : X ++ 1 (we prefer in this context lower case 
letters) with a specific singleton set 1 = {_L} as target. They can be considered 
as Boolean column vectors. To be consonant with the usual notation, we omit 
always the second subscript, i.e., write r x instead of r Xt ±. Then r describes the 
subset Y of X if for all x G X it holds r x iff x G Y. A point p : X o 1 is a 
vector with precisely one 1-entry. Consequently, it describes a singleton subset 
{x} of X and we then say that it describes the element x of X. If r : X -H- 1 is 
a vector and Y the subset of X it describes, then inj(r) : Y ^ X denotes the 
embeddiny relation of Y into X. In Boolean matrix terminology this means that 
inj(r) is obtained from I : X -H- X by deleting all rows which do not correspond 
to an clement of Y and point-wisely this means that for all y G Y and x G X it 
holds inj(r)y tX iff y = x. 

In conjunction with powersets 2 X we will use membership relations M : 
X "R- 2 X and size comparison relations S : 2 X -R- 2 X . Point-wisely they are de- 
fined for all x G X and Y, Z G 2 X as follows: M x> y iff x G Y and Sy,z iff 
\Y\ < |.Z|. A combination of M with embedding relations allows a column-wise 
enumeration of an arbitrary subset 6 of 2 . Namely, if the vector r : 2 •<-> 1 
describes (3 in the sense defined above and we define S — M; inj(r) T , then we get 
I«6as type of S and that for all x G A and Y G 6 it holds S Xi y iff x G Y. In 
the Boolean matrix model this means that the sets of 6 are precisely described 
by the columns of S, if the columns are considered as vectors of typs IhI. 

To model direct products XxY of sets A and Y rclation-algebraically, the 
projection relations -k : Xx7f>X and p : XxY ^Y are the convenient means. 
They are the relational variants of the well-known projection functions and, 
hence, fulfil for all u G XxY , x G A and y G Y the following equivalences: 
tt u . x iff Wi = x and p u , y iff U2 = y- Here ui denotes the first component of u 
and Ui the second component. As a general assumption, in the remainder of the 
paper we always assume a pair u to be of the form u = (u\, 112)- Then u denotes 
the transposed pair (u2,u\). The projection relations enable us to specify the 
well-known pairing operation of functional programming relation-algebraically. 
The pairing of R : Z «• A and S : Z «• Y is defined as [i?, 5] = i?; tt t n S*; p T : 
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Z-^XxY. where ir and p are as above. Point-wisely this definition says that 
[R, SJ Z u iff R z , Ul and S Z<U2 , for all Z € Z and u G XxY . Based on 7r and p we 
are also able to establish a bijective correspondence between the relations of type 
Xf>Y and the vectors of type IxFol. The transformation ofi?:If>7 into 
its corresponding vector vec(R) : XxFolis given by vec(R) = (ir; RHp); L and 
the step back from r : XxY <->l to its corresponding relation rel(r) : Aoy 
by reJ(r) = 7r T ; (p n r;L). Point- wisely this means that for all u € XxF the 
following equivalences are true: vec(i?) M iff R UljU2 and re7(r) Uli „ 2 iff r u . 



3 A Relation-algebraic Model of Condorcet Voting 

Usually, an election consists of a non-empty and finite set N of voters (agents), 
normally N = {1, ... ,71}, a non-empty and finite set A of alternatives (candi- 
dates), the individual preferences (choices, wishes) of the voters and a voting 
rule that aggregates the winners from the individual preferences. A well-known 
voting rule is the Condorcet voting rule. Here it is usually assumed that each 
voter ranks the alternatives from top to bottom, i.e., the individual preferences 
of the voters i G N are expressed via linear strict orders >i :if>A. From them 
the dominance relation C : A ■<-> A is computed that specifies the collective pref- 
erences. An instance of a Condorcet election consists of the sets N, A, and the 
relations >j for all i G N. In the following we consider the approach that C a ,b 
iff the number of voters i with a >j b is (strictly) greater than the number of 
voters i with b >, a. In this case we also say that a beats b with p points, where 
p is the (positive) difference between these numbers. It is known that C may 
contain cycles and that an alternative that dominates all other ones - a so-called 
Condorcet winner - does not necessarily exist. To get around this problem, in 
the literature so-called choice sets have been introduced which take over the role 
of the best alternative and specify the winners (see e.g., [M] for more details). 
In this paper, we will study the choice set Uncovered Set. 

For a relation-algebraic treatment of Condorcet voting, we first model its 
input, i.e., the individual preferences of the voters, accordingly. 

Definition 3.1 The relation P : N -H> A 2 models the instance (N,A,(<i) ie jy) 
of a Condorcet election if Pi fU is equivalent to U\ >i u 2 , for alii G N and u G A 2 . 

In the following RelView picture an input relation P is shown. The labels of 
the rows and columns indicate that the voters are the natural numbers from 1 
to 13 and the alternatives are the eight letters from a to h. 
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It is troublesome to identify from this picture the individual preferences. But 
if we select the single rows, transpose them to obtain vectors of type A 2 f>l 
and apply the function rel of Section [2] to the latter, then RelView depicts the 
individual preferences as Boolean matrices. For the rows 1, 4, 7 and 10 we get, 
in the same order, the following Boolean matrices for >i, >4, >7 and >n: 



Now, the preferences of the single voters are easy to seq^] Voters 1 to 3 rank 
their alternatives from top to bottom as a, c, e, g, b, d, f, h, voters 4 to 6 as 
a, b, c, d, e, /, g, h, voters 7 to 9 as b, a, d, c, /, e, h, g and the remaining voters 
10 to 13 as h, g, f,e,a,b,c,d. The procedure also shows how to construct, in 
general, the input P : N <-> A 2 from strict orders >.; ; if> A by inverting it. We 
have to number the voters from 1 to n, then to transform each relation >; into 
vec(>i) T : 1 f-> A 2 , i.e., the transpose of its corresponding vector, and finally to 
combine the transposed vectors row by row into a Boolean matrix. The latter 
means that we have to form the relation-algebraic sum vec(>i) T + - • --|-vec(> n ) T . 
We won't to go into details with regard to sums of relations and refer to [19 , 
where a relation-algebraic specification via injection relations is given. Instead, 
we demonstrate how to get from the individual preferences relation P the col- 
lective preferences, i.e., the dominance relation C. In what follows, we assume 
the projection relations x, p : A 2 <-> A of the direct product A 2 to be at hand as 
well as the membership relation M : N •<->• 2 N and the size comparison relation 
S : 2 Ar f>2 ,v . Each of these relations is available in RelView via a pre-defined 
function and their BDD-implcmentations are rather small. See |15I16| for details. 

Theorem 3.1 Suppose that P : N -f-> A 2 models an instance of Condorcet vot- 
ing. If we specify relations E, F : A 2 -h- 2^ and C : A <-> A by 

E = syq(P,M) F = syq(P;[p,7r],M) C = rei((£:n F; (S n S^); £), 

then C UliU2 is equivalent to \{i £ N | Pj iU }| > \{i G N | Pi.u}| 7 for all u G A 2 . 

Proof. For the given u £ A 2 we prove in a preparatory step for all Y G 2 N that 

E u ,y <=^ syq(P, M) u .y 

<==>Vi€N :P i}U oi<=Y 
^ {* G N | P,4 = Y. 

Using that the exchange relation [p, x] : A 2 -n- A 2 relates the pair u precisely 
with its transposition u = (u 2 , U\), in a rather similar way we can prove that for 



1 A still more appropriate method is to compute for each relation >< its Hasse diagram 
in the sense of [18] and to draw the latter in RelView as directed graphs. 
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all Z E 2 N the following property holds: 

F u , z <=^{i£N\ P i>a } = Z 
By means of these two auxiliary results, we now conclude the proof as follows: 



L« 



/«1,«2 



• rel((E n F; (S n ST));L), 
■((EnF;(Sn^));L) u 

- 3 Y e 2 N : E u .y A (F; (S n S^ )) u ,y A Ly 
■^£2^: £ u ,y A 3Z e 2 W : F„, z A S z ,y A -.Sy, z 

- 3 7 e 2 W : £„y A3Ze2 w : F M , Z A |Z| < |F| A \Y\ > \Z\ 
-3Y,Ze2 N : {i £ N \ P itU } = Y A{i £ N \ P iA } = Z A\Z\ < \Y\ 

- \{i € iV | P iiU }| > |{* € TV | P iifl }| 



n 



The specihcations of Theorem |3.1| can be executed by means of RelView after a 
straightforward translation into its programming language. In case of the above 
input relation P the tool computed the following dominance relation C. From 
the first row of C we see that alternative a is tha Condorcct winner since it 
dominates all other alternatives. 



I-:-:: 



This relation is not only asymmetric (i.e., satisfies CnC T = 0) but also complete 
(i.e., satisfies I C C U C T ). Altogether, C is a tournament relation and this 
property implies the uniqueness of a Condorcet winner in the case that one 
exists. How to compute, in general, from the dominance relation C the choice 
sets using relation-algebraic means is demonstreted in [4j. 



4 Control of Condorcet Voting by Deleting Voters 

We only consider the constructive variant of the control problem for Condorcet 
voting, where control is done by deleting voters. Usually, the task is formulated as 
a minimization-problem: Given a specific alternative a* , determine a minimum 
set of voters Y such that the removal of Y from the set N of all voters makes a* 
to a winneir] To allow for an easier relation-algebraic representation, we consider 
the dual maximization-problem, i.e., we ask for a maximum set of voters X such 
that a* wins subject to the condition that only voters from X are allowed to 
vote. It is obvious that from X then a desired Y is obtained via Y — N \ X. 

We start with the assumption that 'to win' means 'to be a Condorcet winner'. 
As shown in [5] , Condorcet voting is computationally resistant to our control type 



2 The destructive veriant of our control problem asks for a minimum set of voters the 
removal of which prevents win of a* . 
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in case of this specification of winners. I.e., it is NP-hard to decide, for a* E A 
and fcgNas inputs, whether it is possible to find k voters whose removal makes 
a* to a Condorcet winner. 

As a first step towards a solution of the maximization-problem, we relativize 
the dominance relation C by additionally considering the sets of voters X which 
only are allowed to vote. Concretely this means that we specify a relation R 
that relates X e 2 N with a, 6 G A iff \{i £ X \ a > t b}\ > \{i G X \ b > 4 a}\. 
Since we work with binary relations, we have to combine two of the three objects 
X, a and b to a pair. We do this with a and o, i.e., relate X with u under the 
assumption that u± equals a and ui equals b. Then the following theorem shows 
how the relativized dominance relation R : 2 N o A 2 can be specified relation- 
algebraically. Again we assume the relations n, p : A 2 ■<-> A, M : N <H-2 N and 
S : 2 N <-> 2^ to be at hand. 

Theorem 4.1 Suppose again that P : N O A 2 models an instance of Condorcet 
voting. If we specify relations E, F : 2 N xA 2 «-» 2 N and R : 2 N O A 2 by 

E = Byq([M,P\,M) F = syq([M,P; fan]], M) R = rel((En F; (Sn S T )); £), 

then Rx,u is equivalent to \{i G X | Pi )U }| > |{i G X | P^uIIj /or aH X G 2^ 
and tieA 2 . 

Proof. Assume arbitrary objects Ie2 iv and u G ^4 2 to be given. Then, we have 
for all Y £ 2 N the following equivalence: 

E{x,«),Y*=^-syq([M,P],M)(.x,«),Y 

^V ! eJV:[M,%„ ) f)M l , y 

<^ V* G iV : M i)X A P 4 , u O M,,y 
<^ V* G iV : i G X A P i)U -H- i G Y 

^{!£l P,4 = Y 

In a similae way we can show for all Z G 2 N the following fact, using the property 
of the exchange relation [p, 7r] : A 2 «-» A 2 mentioned in the proof of Theorem 

P(x,«),z ^{*el| P, fi } = z. 

Now, the following calculation shows the claim: 

Rx,u<=^rel((EnF;{Sn ST));L) X ,„ 
^((£nF;(SnST));% ilt) 

<=► 3 Y G 2 W : £(x, u) ,y A (F; (S n S^ ))(x, u ),y A L Y 
^=^3Y e2 N : E {x>u)jY A3Z e2 N : F (x ,u),z A S z ,y A -S Y , Z 
^3F^e2 w :{ ie X|P, U }=YA{«G X | P ijfi } = Z A |Z| < |Y| 

^|{«el| p,„}| > |{t G A | P iA }\ a 



3.1 
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In the second step, we now take the relativized dominance relation R of Theorem 



4.1 and specify with its help a vector cand : 2 N f-> 1 that describes the subset of 
2 N the members of which are the sets X which are candidates for the solution of 
our control problem. The latter property means that a* is a Condorcet winner, 
provided that only voters from X are allowed to vote. From the vector cand 
we then finally compute the vector description sol : 2 N f>l of the maximum 
candidate sets, which are the solutions we are looking for. The next theorem 
shows how to get cand and sol from R and a*. 

Theorem 4.2 Suppose that R : 2 N -H> A 2 is the relation specified in Theorem 
\4-l\ and that the specific alternative a* £ A is described by the point p : A •<-> 1. 
// we specify vectors cand, sol : 2 N -h- 1 by 



cand = R ; (ft]p l~l p',p) sol— cand l~l S T ;cand, 

then the set{Xe2 N \Vbe A\{a*} : \{i £ X | P ii{a *, b) }\ > \{i £ X | P <l(6l «.)}|} 
is described by cand and the set of its maximum sets by sol. 

Proof. Since p describes a* , for all u £ A 2 we have (ir;p) u iff u\ = a* and p;p u 
iff «2 ^ a* . We now assume an arbitrary set X £ 2 W and calculate as follows, 



where in the fifth step Theorem 4.1 is applied 



candx ^=> R;(tv;pC\ p;p) x 

«=► ^3u£ A 2 : R x ,u^ (n;p)u A p[p n 
«=*> -i3 u £ j4 2 : i? x,u A ui = a* A ii 2 7^ a* 



A'. 



<=* Vu £ A 2 : ui = a* A u 2 7^ «* -> |{» € X | Pi, u }| > \{i E X \ P^}\ 
^VbeA:b^a*^\{iEX\ P it(a * tb) }\ > \{i £ X | P i)(6 , a .)}| 
^ V6 £ A \ {a*} : |{t £ X I P if(a ., 6) }| > |{* £ X | P il( 6 >0 .)}| 

Hence, the first claim follows from the definition of the set a vector describes. To 
prove the second claim, we take again an arbitrary set X £ 2^. Then, we get: 

solx <=>■ (cand n S T ; cand )x 
<^=> candx A S T ; cand x 
<^=> candx A ^3 Y" £ 2 N : S yx A candy 
<^=> candx A V V £ 2 W : candy — > Sy x 
«=>> candx A VF £ 2 W : candy ->• |V] < \X\ 

This equivalence implies that sol describes the set of maximum sets of voters X 
for which candx holds, that is, for which a* wins subject to the condition that 
only voters from X are allowed to vote. □ 

Using RelView we have solved our control problem with Condorcet winners 
as winning alternatives for the above input relation P and each of the eight 
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alternatives. The tool showed that only the alternatives a, b and h can made to 
Condorcet winners by deleting voters. Some of the results for these alternatives 
are presented in the following six RelView pictures: 



The vector on position 1 says that a is a Condorcet winner if all voters are 
allowed to vote and the corresponding dominance relation on position 2 is the 
original dominance relation C . To make b to a Condorcet winner at least eight 
voters must be deleted. Altogether there are 45 possibilities for this. The vector 
on position 3 shows one of them, where the voters from 1 to 6 and the voters 10 
and 11 are deleted. On position 4 the resulting dominance relation is depicted. 
To get h as Condorcet winner requires a removal of at least six voters. According 
to RelView there are 85 possibilities for this. One of them and the resulting 
dominance relation are depicted at positions 5 and 6. 

Since Condorcet winners do not always exist, choice sets have been introduced 
as a general concept that always allows to define the winners of Condorcet voting. 
In the remainder of this section we treat a well-known example, the uncovered 
set. This choice set is usually defined via an induced transitive subrelation of the 
dominance relation C, called covering relation. In the literature different such 
relations are discussed. We concentrate on a relation G : A <H- A that in [5] is 
called Gilles covering and in [5] upward covering. Its usual point-wise definition 
says that G a ,b iff C a ,b and for all c G A from C c ^ a it follows C c .b, for all a, b G A. 
This relation-algebraically can be specified as equation G = C D C T ; C . The 
(Gilles or upward) uncovered set is the set of all a G A such that there exists 
no b G A \ {a} with Gb, a - It is non-empty because G is a strict-order and A is 
finite. To the best of our knowledge, the computational complexity of control 
problems for Condorcet elections with winning conditions different from being 
a Condorcet winner has not been studied in the literature. We obtain the first 
result in this direction by proving that the problem to control Condorcet elections 
with upward covering by deleting voters is NP-hard (see Section pi). To solve our 
control problem for this specification of winners we use the same idea as in the 
relativization of the relation C to the relation R by additionally considering the 
set of voters X which are allowed to vote. The next theorem shows how to obtain 
the relativized covering relation U from the relativized dominance relation R. 

Theorem 4.3 Suppose again that R : 2 N -h- A 2 is the relation specified in The- 



Jl\ If we specify relations E : A 2 x A 2 o A 2 and U : 2 N xA 2 -o- 2 
E= [w; p r , p; p r f f) vec(n;ir T ); L U = R n [R,R];E, 
then for all X G 2 N and u G A 2 we have 



N 



by 



U x ,u *=> Rx,u AV c € A : R 



X,(c, Ml ) 



R 



X,{c,u 2 )- 
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Proof. Let X G 2 N and u G A 2 be given. In a first step we show for all v, w G A 2 
the following property, where we use the equivalence of (tt',P T ) u ,v &ud u\ — v%, 
of (p;p T ) u ,w and u 2 = W2, and of (ir; ir T ) vw and i>i = Wii 

E{v,w),u *^> ([tt; /0 T , /o; /0 T ] n vec(7r; tt t ); L)( U)U) ), u 

«=*> [7r;p T ,P;p T L, (t ,^) A (vec(7r;7r T );L) ( ^ jlo)iU 
^=^ (t; P T )«,t, A (p; p T ) u , w A vec(7r; 7r T ) ( „ iU ,) 
<^=^ u ± = v 2 A u 2 = w 2 A (ir; ir T ) v ^ w 
<^=^ ui = v 2 A u 2 = w 2 A vi = W\ 

We now can calculate as follows to conclude the proof: 



U x ,u*=*(Rn [R,Rj;E 



x. 



^=^ Rx,u A [R, R];E Xu 

^=> Rx.u A -^3 v, w G A 2 : [R, R] x ,(v,w) A E (ViW)<u 

•^=^ Rx.u A -^3 v, w G A 2 : Rx,v A R x.w A U\ = v 2 A u 2 = w 2 A v\ — W\ 

-*=^ Rx,u A^Bc <E A : Rx,(c,ux) A Rx,(c,u 2 ) 

^=^ Rx,u A Vc G A : Rx,( c ,ui) ~* Rx,(c,u 2 ) D 

After this result we are able to solve our control problem also for the uncovered 
set as set of winners. We use again a vector cand for the description of the 
candidate sets and a vector sol for the description of the sulutions. 

Theorem 4.4 Suppose that U : 2^ •<->• A 2 is the relation specified in Theorem 
\4-3\ and that the specific alternative a* G A is described by the point p : Af*l. 
If we specify vectors cand, sol :2"f*l by 



cand= U;(ir;p Ci p;p) sol = cand n S T ;cand, 

then the set {X G 2^ | -<3b E A\ {a*} : Uxib.a*)} * s described by cand and the 
set of its maximum sets by sol. 

Proof. Because a* is described by p, for all u G A 2 we have 7r; p u iff u± 7^ a* 
and (p;p) u iff u 2 = a*. Now, for all X G 2 N we can calculate as follows to show 



the first claim (for the second claim cf. the proof of Theorem 4.2 1 



candx •*=*• U;(ir;p (1 p;p) x 

«=4> n3ue A 2 : Ux.uA TTp u A(p;p) u 
4=4> ^3 u G A 2 : Ux.u A u\ ^ a* A u 2 = a* 
<=*--.3&€ A:Ux,(b, a *) Ab^a* 
<^^3beA\{a*}:U x ,(b,a*) □ 

As already mentioned, the uncovered set is always non-empty. The degenerate 
case is that no voter is allowed to vote. Then the resulting dominance relation 
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as well as the induced covering relation are empty and, thus, the uncovered set 
equals N. RelView showed that in our running example this situation occurs 
if c or d shall win. We already know that a wins without a removal of voters. By 
reason of the tool at least five voters must be deleted to ensure win for e, /, g or 
h and the corresponding numbers of possibilities are 11, 111, 15 and 126. And, 
finally, b becomes winning if at least seven voters are not allowed to vote. To 
reach the goal there exist 120 possibilities. We end this section with the following 
three RelView pictures that concern alternative e: 



I ■ ■ 



The vector shows that e is in the uncovered set if the voters 1, 2, 4, 5 and 6 
are deleted, the relation in the middle is the dominance relation resulting from 
this, and the relation on the right is the induced covering relation. The empty 
columns show that, besides e, the removal also make a, / and h uncovered. 



5 Control Remains Hard if Uncovered Alternatives Win 



As already mentioned, in [2] it is shown that for Condorcet voting constructive 
control by deleting voters is NP-hard if Condorcet winners are defined as winners. 
In this section we prove that this result remains true if instead of Condorcet 
winners the uncovered alternatives are taken. To this end we first introduce the 
following problem that we will be used in our reduction. 

Definition 5.1 The problem X4C ('exact cover by 4-sets,) is the following: 

Input: Sets Si,...,Sk G 2^ 1: ' ,n ' such that for all i G {1, . . . , k} it holds 
\Si\ =4 and\{i | jG S, t }\ = 3 for all j G {l,...,n}. 

Question: Is there some set I G 2^ 1 '' ,k ' such that [J ieI Si — {1, ...,n} and 
Si H Sj = for all i,j £ I with i^j? 

Note that if an I as required exists, then |/| = |n, since each Si has cardinality 4 
and the union must have cardinality n. On the other hand, if an / with {J ieI Si = 
{1, . . . , n} exists and |/| = ^n, then by a simple counting argument, Si n Sj — 
for all i, j G I with i =/= j. Also, the value k in the problem instance must 
necessarily be equal to |n, since each Si has 4 elements and each j € {1, . . . , n} 
appears in exactly 3 of the sets Si. In particular, it follows that n is a multiple 
of 4 in every instance fo X4C. The following result is mentioned without proof 
in [9], we give the complete proof: 

Lemma 5.1 The problem X4C is NP-hard. 
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Proof. We reduce from a special version of the l-in-3-satisfiability problem, called 
l-in-3-Sat' and introduced in [5J. An instance of l-in-3-Sat' is a formula of the 
form if = /\ i=1 l-in-3(a; l 1 , x\, x\) 1 where l-in-3(x, y, z) is a clause which is true iff 
exactly one of the variables x, y, and z is true. Additionally, p has the following 
properties: In each clause the 3 appearing variables are distinct, and each variable 
appears in exactly 4 clauses. Note that this implies that the number of distinct 
variables in c^ is |n. 

An instance ip of the problem l-in-3-Sat can be transferred into an instance 
of X4C as follows: 

— Each of the n clauses in ip becomes a set element of {1, . . . , n}, which we can 
then rename to values 1, . . . , n. 

— Each variable Xi becomes a set Si containing the clauses in which Xi appears. 

First assume that <p is satisfiable. Then there is an assignment / with I \= p. 
Since I satisfies exactly one variable in each clause, we know that n variable oc- 
currences are satisfied by /. Since each variable, in particular each of the satisfied 
variables, appears in 4 clauses, we get that jn many variables are satisfied by /. 
We can naturally interpret I as the set of indices i with I \= Xi and claim that I 
satisfies the conditions of X4C. As mentioned above, since |7| = |n, it suffices to 
show that Uiej = {1j • ■ • > n l- This follows from the construction: Since I (seen 
as a truth assignment to the variables) satisfies each clause, we know that for 
each clause, there is a variable satisfied by I. For the X4C instance, this implies 
tha for each element i € {1, . . . , n}, there is an index j £ I with i £ Sj. 

For the converse, assume that there is an index set I satisfying the conditions 
of X4C. We can interpret I as a truth assignment for the variables in ip in the 
obvious way: A variable is set to 1 iff its corresponding set is in the selection 
I. We show that I, seen as a truth assignment, satisfies the formula ip. Hence 
let l-m-3(x\, x\, x\) be a clause in (p. Since I is a set cover, we know that for 
this clause, an element containing the set element corresponding to the clause is 
selected in I. Hence I satisfies at least one of the variables x\, x\, and x\. Since 
/ is an exact cover, we also know that each set element appears only in one of 
the selected sets, hence only one of the variables is true, and we are done. □ 

We can now show the main theorem of this section. 

Theorem 5.1 For Condorcet voting the constructive control problem by deleting 
voters is NP-hard if the uncovered alternatives are specified as the winners. 



Proof. We reduce from X4C, which is NP-hard due to Lemma [5T| So, let an X4C- 
instance consisting of the sets Si, ... , Sa n be given. Without loss of generality we 
assume n > 16. From the instance, we construct an election E as follows. First we 
define t = \n — 2 (recall that in every instance to X4C, n is a multiple of 4, hence 
t is always an integer). Next we introduce alternatives a* (the alternative that 
has to win), s±, . . . , s n and b\, . . . , b n . Finally, we introduce the following four 
groups of individual preferences, where S^i = {sj | j ^ i}, B^ L = {bj | j ^ i}, 
B&i = {bj I j £ Si] and B eSi = {b 3 \ j e #}. 
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1. For each i G {1, . . . , n} we use t linear strict orders of the form S^i > Si > 
hi > B^ % > a* . 

2. For each i € {1, . . . , n} we use t linear strict orders of the form B^i > a* > 
Si > bi > Sjti. 

3. For each set Si we use a linear strict order of the form B^g. > a* > S > B^Si- 

4. We use a linear strict order of the form a* > S > B. 

The notation of preferences (linear strict orders) using sets means that the or- 
der of the alternatives inside the sets is irrelevant. For instance, a* > S > B 
means that in the linear strict order a* is the greatest element, then the alterna- 
tives si, . . . , s n follow in any order and, finally, the alternatives b\, . . . , b n follow, 
again in any order. Note that, by definition of X4C, we get I-Bgsj = n — 4 and 
\B e Si I = 4. Now, the question in our constructed instance of the control problem 
is whether the specific alternative a* can be made uncovered by deleting at most 
^n linear strict orders (i.e., voters). 

We first study the relationship between each of the relevant alternatives in 
the constructed election before any deletion of voters is performed. Note that if 
the point difference between two alternatives is at least \n + 1, then deleting at 
most |n linear strict orders cannot change which of these alternatives dominates 
the other. 

Each bi beats a* with at least |n + 1 points. To see this, we consider all 
preferences introduced in the election. For each j =/= i, the 2t linear strict 
orders of the first two groups place bi ahead of a*. From the linear strict 
orders introduced for i, one puts bi ahead of a* and the other puts a* ahead 
of bi. We now consider the linear strict orders introduced for the sets Sj: 
There are 3 sets Sj in which i appears (these place a* ahead of bi) and i 
does not appear in the remaining |n — 3 many (these place bi ahead of a*). 
Finally, a* > S > B put a* ahead of bi. Hence the lead of bi over a* is 

3 3 

(n - 1) • 2 • t + -n - 6 - 1 = 2(n - l)i + -n - 7 

v ' 4 4 

which is at least ^n + 1, since we assumed n > 16. 

Alternative a* beats each Si with at least \n + 1 points. Note that half 
of the linear strict orders introduced in the first two groups place Si ahead 
of a* and the other half put a* ahead of s^. Hence a* and Si tie in the sub- 
election consisting of these linear strict orders. In the |n linear strict orders 
introduced for the sets Si, however, a* is always placed ahead of s^. Finally, 
a* is ahead of Si in a* > S > B. As a consequence a* beats each s, with 
|n + 1 many points, which is at least \n + 1. 

If i -£ j, then bi beats Sj with at least \n + 1 points. To sec that this is 
true, note that the linear strict orders introduced in the first two groups 
are neutral between bi and Sj , as half of them have bi ahead of Sj and the 
other half have Sj ahead of bi (recall that i ^ j). Now consider the linear 
strict orders introduced for the sets Si. There are 3 such orders which place 
Sj ahead of bi (the ones corresponding to sets Si with i £ Si), and the re- 
maining |n — 3 many place bi ahead of Sj (these are the ones corresponding 
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to sets Si with i £ Si). In a* > S > B, s, is voted ahead of &,. Hence bi 
beats Sj by §n — 7 points, which is at least |n + 1, since again n > 16. 
Alternative bi beats s, with exactly |n — 3 points. This holds due to the 
following: The linear strict orders introduced in the first two groups for j =/= i 
are neutral with respect to the relationship between Sj and bi (half of them 
put Si ahead of bi, the other half put bi ahead of Sj). The 2t many linear 
strict orders introduced for i in the first two groups all put Si ahead of bi. 
Now we consider the linear strict orders introduced for the sets Sj . If i € Sj, 
then Si is ahead of bi here, this happens 3 times. In the remaining |n — 3 
linear strict orders introduced for the sets Sj , we have that i ^ Sj and hence 
in these linear strict orders, bi is ahead of s,. In a* > S > B the alternative 
Si is voted ahead of bi. Together we have that bi beats Si with 

3 3 

-2t - 3 + -n - 3 - 1 = -n - 2t - 7 

4 4 

votes. Since t = \n — 2, it follows that \n — 2t — 7 = \n — 3 as required. 

In particular, it follows that by deleting at most \n voters, the only relevant 
relationships that can be influenced arc those between bi and s, (we will see that 
the relationships between bi and bj or Si and Sj for i ^ j are not relevant). 

We now show that the reduction is correct: The instance of X4C is positive 
iff a* can be made a winner of the election using the Condorcet criterion with 
uncovered set by deleting at most \n linear strict orders. 

First, assume that the instance is positive, and let / be a corresponding index 
set. We delete the \n linear strict orders corresponding to the elements in / and 
denote the resulting election with E' . Then a* indeed is uncovered in E' . To 
show this, it suffices to prove that none of the bi covers a* , since a* wins against 
all of the Si (since a* leads against Sj with at least \n+I linear strict orders, this 
remains true also after deleting at most jn linear strict orders). Hence, assume 
that some bi covers a* in E' . It suffices to prove that Si dominates bi in E' , then, 
since a* dominates Si in E' , it follows that bi does not cover a*. Note that in 
the original election E the alternative bi beats s, with |n — 3 points. Deleting 
the |n linear strict orders corresponding to / has the following effect: 

a) For the deleted linear strict orders corresponding to sets Sj with i ^ Sj, 
the alternative Si gains a point against bi. Since i appears in exactly one of 
the chosen and \n linear strict orders are deleted, this means that Si gains 
jfi — 1 points against bi from these linear strict orders. 

b) For the single deleted linear strict order corresponding to a set Sj with i € Sj, 
the alternative Si loses a point against bi. 

Hence altogether, Si gains \n — 2 points against bi and, thus, now beats bi with 
a single point. Therefore, as claimed, bi does not cover a*. 

For the converse direction, assume that it is possible to make a* a winner 
of the election by deleting at most \n linear strict orders. Again, let E' be the 
election resulting from E by the deletions. Since the relatinship between the 
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bi's and a* cannot be changed by deleting at most jn linear strict orders and 
bi wins against a* in the original election E, all bi also win against o* in E'. 
Since a* is a winner in E 1 , it follows that for each bi there must be a alternative 
dominating bi who does not dominate a* . Since for i ^ j, we know that bi wins 
against Sj in the election E', it follows that for all relevant i the alternative S, 
wins against bi in E' . Since bi wins against s» with ^n— 3 points, it follows that 
each Si must gain at least jn— 2 points against bi by the removal of linear strict 
orders. Hence n(\n — 2) = |n 2 — In points need to be gained collectively by 
all Si against their corresponding bi. Obviously, only deleting linear strict orders 
introduced for the sets Sj helps to let Si gain points against bi. Deleting one of 
these linear strict orders gains n — 8 points (since it hurts for the 4 values of i 
with i G Sj, and helps the remaining n — 4 ones). Hence, by deleting ^n linear 
strict orders we can gain at most \n ■ (n — 8) = |n 2 — 2n points. Since this is 
the total number of points that need to be gained, we know that exactly \n 
linear strict orders are deleted to obtain the election E' , and each of these linear 
strict orders is one introduced for a set Sj . Now assume that there is some i such 
that two linear strict orders corresponding to sets Sj 1 and Sj 2 are deleted, where 
i G Sjj and i G Sj 2 and j\ ^ j 2 . Then Sj gains a point against bi in at most 
|n — 2 of the deletions and loses in at least 2 of them. Hence, S; gains at most 
|n — 4 points against bi and this implies that Si loses against bi in E' , which is a 
contradiction. Therefore, it follows that each i is contained in at most one of the 
Sj whose corresponding linear strict order is deleted. Due to cardinality reasons 
(jn linear strict orders corresponding to sets of 4 elements each are deleted), 
it follows that each i appears in exactly one set. As a consequence, we have 
obtained a set cover as required. □ 



6 Conclusion 



In this paper, we have demonstrated that the relation-algebraic approach can be 
used to solve NP-hard problems from Social Choice Theory. In particular, this 
shows how Computer Algebra tools can be used to obtain practical algorithms 
for hard problems without relying on domain knowledge for optimizations. Our 
results support the point of view that proving NP-hardness is not sufficient in 
order to conclude that a voting system is "safe" from attempts to influence the 
outcome of an election. In addition to the execution of algorithms, RelView also 
provides us with visualizations of both the input and output of the algorithms 
and some further features that support scientific experiments, like step-wise ex- 
ecution, test of properties and generation of random relations. All this makes 
the approach especially appropriate for prototyping and experimentation, and 
as such very instructive scientific research as well as for university education. 

An interesting open question is whether similar problems from the Social 
Choice literature, as for example the manipulation problem mentioned in the 
introduction, can also be solved with RelView or other Computer Algebra 
tools. 
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